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A NOVEL DISINTCGRTN METALLOPROTBASE AND METHODS OF USE 



Field ofthe inventioi^ 

The invention relates to a novel protein, its fragments and mutants and to its use 
in detecting and testing drugs for ailments, including osteoarthritis and others 
characterized by up regulation of metalloproteases. 
Background 

A number of enzymes eflfect the breakdown of structural proteins and are 
structurally related metaUoproteases. These include human skin fibroblast coUagenase. 
human skin fibroblast gelatinase. human sputum collagenase and gelatinase, and humail 
stromelysin. These are zinc^ntaining metaUoprotease enzymes, as are the angiotensin- 
converting enzymes and the enkephalinases. Collagenase, stromelysin and related 
enzymes arc important in mediating the symptomatology of a number of diseases, 
including rheumatoid arthritis (MuUins, D. E., et al., Biochim Biophys Acta (1983) 
695:117-214); osteoarthritis (Henderson, B., et al.. Drugs of the Future (1990) 15:495- 
508); the metastasis of tumor cells (ibid. Broadhurst, M. J., et al., European Patent 
AppUcation 276,436 (published 1987X Reich. R.. et al., 48 Cancer Res 3307-3312 
(1988); and various ulcerated conditions. Ulcerative conditions can result in the cornea 
as the result of alkali bums or as a result of infection by Pseudomonas aerugin sa, 
Acanthamoeba, Herpes simplex and vacdnia viruses. 

Other conditions characterized by undesired metaUoprotease activity include 
periodontal disease, epidermolysis buUosa and scleritis. In view of the involvement of 
metalloproteases in a number of disease conditions, attempts have been made to prepare 
inhibitors to these enzymes. A number of such inhibitors are disclosed in the Uterature. 
The invention seeks to provide novel inhibitors, preferably specific to this protease, that 
have enhanced activity in treating diseases mediated or modulated by this protease. 

Inhibitors of metalloproteases are usefiil in treating diseases caused, at least in 
part, by breakdowaof structural proteins. A variety of inhibitors have been prepared, but 
there is a continuing need for metaUoprotease inhibitor screens to design drugs for 
treating such diseases. 

MetaUoproteases are a broad class of proteins which have widely varied functions. 
Disintcgrins are zinc metaUoproteases, abundant in snake venom. Alternate cloning 
strategies could be used. MammaUan disintcgrins are a family of proteins with about 1 8 



known subgroups. They act as cell adhesion disrupters and arc also known to be active in 
reproduction (for example, in fertilization of the egg by the sperm, including fusion 
thereof, and in sperm maturation). 

These proteases and many others are uncovered in molecular biology, and 
biochemistry. As a result, Genbank, a repository for gene sequences, provides several 
sequences of metalloproteascs, including some said to encode fragments of disintegrins. 
For example, GenBank accession # Z48444 dated February 25, 1994 discloses 2407 bases 
of a rat gene said to be a rat disintegrin metalloprotease gene; GenBank accession # 
Z48579 dated March 2, 1995 discloses 1824 bases of a partial sequence of a gene said to 
be a human disintegrin metalloprotease gene; GenBank accession # Z21961 dated 
October 25, 1994, discloses 2397 bases of a partial sequence of a gene said to be a bovine 
zinc metalloprotease gene. 

It would be advantageous to implicate metalloproteascs in specific disease states, 
and to use these metalloproteases as tools to detect and ultimately cure, control or design 
cures for such diseases. 

OBJECTS OF THE INVENTTON 

It is an object of the present invention to provide a method for identifying 
compounds capable of binding to the disintegrin protdn. 

It is also an object of the present invention to provide a host cell compri»ng a 
recombinant expression vector to the disintegrin protein and a recombinant expression 
veaor encoding to the disintegrin protein. 

It is also an object of the present invention to provide a method for screening for 
metalloprotease mediated diseases such as cancer, arthropothies (including ankylosing 
spondolytis, rheumatiod arthritis, gouty arthritis (gout), inflanunatoty arthritis, Lyme 
disease and osteoarthrtis). 

It is also an object of the present invention to provide an antibody to the protdn 
useful in the screen, in the isolation of the protein or as a targeting moiety for the protdn. 

SUMMARY OF THE INVENTION 

This invention provides a method for identifying compounds capable of binding to 
the disintegrin protein, and determining the amount and affinity of a compound capable of 
binding to the disintegrin protein in a sample. 

This invention also provides a host cell comprising a recombinant expression 
vector to the disintegrin protein and a recombinant expression vector encoding to the 
disintegrin protein and the human disintegrin metalloprotease protein, fragment or mutant 
thereof^ usefiil for these purposes. 
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This invention also provides an in vivo or in vitro method for screening for 
osteoarthrtis and other metalloprotease based diseases, such as cancer, enable of 
manufacture and use in a kit form. 

DETAILED DESCRIPTION 

As used herein, the teiros "protein," "protease," and "n^talloprotease" refer to a 
disintegrin. Prefe^ly this is a human dismtegrin as described below. 

The term "antibody" refers to an antibody to a disintegrin, or fragment thereof 
These many be monoclonal or polyclonal, and can be from any of several sources. The 
invention also contemplates fragments of these antibodies made by any method in the 
proton or peptide art. . 

The term "disease screen" refers to a screen for a disease or disease state. A 
disease state is the physiological or celluiar or biochemical manifestation of the disease. 
Preferably this screen is used on body tissues or fluids of an animal or cell culture, using 
standard techniques, such as ELISA, It also contemplates "mapping" of disease in a 
whole body, such as by labeled antibody as described above given systemically: regardless 
of the detection method, preferable such detection methods include fluorescence. X-ray 
Cmcluding CAT scan), NMR (Including MRI), and the like. 

The term "compound screen" is related to the methods and screens related to 
finding compounds, determining their affinity for the protease, or designing or selecting 
compounds based on the screen. In another embodiment, it contemplates the use of the 
three dimensional structure for drug design, preferable "rational drug design", as 
understood by the art. It may be preferred that the protease is in "essentially pure form", 
which refers to a protein reasonably free of other impurities, so as to make it useful for 
experiments or characterization. Use of this screening method asasts the skilled artisan in 
finding novel structures, whether made by the chemist or by nature, which bind to and 
preferably inhibit the protease. These "inhibitors" may be useful in regulating or 
modulating the activity of the protease, and may be used to thus modulate the biological 
cascade that they function in. This approach affords new pharmaceutically useful 
compounds. 

The term "disintegrin" refers to a disintegrin, a fragment thereof; a mutant thereof 
or a homologue which still retains its function. This term contemplates aggracanase, and 
other proteases which are involved in or modulate tissue remodelling. This contemplates 
diantegrins from differing species, and those prepared by recombinant methods, in vitro 
methods, or standard peptide synthesis. Preferably the protein is a human disintegrin or 
mutant thereof For the purposes of defining the mutants of the protein the preferred 
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"native" protein is described in GenBank accession » Z48579, incorporated herein by 
reference and referred to in the sequence below. SEQ ID NO 1 describes a fragment of 
that DNA sequence and its transcript and SEQ ID NO 2 describes the coding region of 
the gene and its transcript. Homologue disintegrins include whole proteins with at least 
90% homology as understood by the art, or fragments thereof. For example, a rat 
protein which is 95% homologous to that of SEQ ID NO. 1 or SEQ ID NO. 2 based on 
the peptide sequence derived from the DNA or cDNA sequence, and a bovine protein 
(similariy derived) being 97-98% homologous, are both considered homologues. Thus 
homologous cDNAs cloned from other organisms give rise to homologous proteins. 

Likewise proteins may be considered homologues based on the amino acid 
sequence alone. Practical limitations of amino acid sequencing would allow one to 
determine that a protein is homologous to another using for example comparison of the 
first 50 amino adds of the protein. Hence 90% homokigy in would allow for 5 differing 
amino acids in the chain of the first 50 amino acids of the homologous protein. 

The skilled artisan will appreciate that the degeneracy of the genetic code provides 
for differing DNA sequences to provide the same transcript, and thus the same peptide. In 
certain cases preparing the DNA sequence, which encodes for the same peptide, but 
differs from the native DNA include; 

— ease of sequencing or syntlwas; 

— increased expresaon of the peptide; and 

— preference of certain heterologous hosts for certain codons over others. 

These praaical considerations are widely known and provide onbodiments that 
may be advantageous to the user of the invention. Thus it is deariy contemplated that 
the native DNA is not the only embodiment enviaoned in this invention. 

In addition it is apparent to the skilled artisan that fragments of the protein may be 
used in screening, drug design and the like, and that the entire peptide nuy not be 
required for the purposes of using the invention. Thus it is dearly contonplated that the 
skilled artisan will understand that the disclosure of the peptide and its uses contemplates 
the useful peptide firagments. 

The practical considerations of protein «pression, purification yidd, stability, 
solubility, and the like, are coniddered by the skilled artisan when chooang wh^er to use 
a fragment, and the fragment to be used. As a result, using routine practices in the art, 
the artisan can, given this disclosure practice the invention using Augments of the protdn 
as well. 
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The protein or protease itsdf can be used to determine the binding activity of small 
molecules to the protein. Dnig screening using enzymatic targets is used in the art and 
can be employed using automated, high throughput technologies. 

The inhibition of disintegrin activity may be a predictor of efficacy in the treatment 
of osteoarthritis, and otiier diseases involving degeneration of articular cartilage. 
Gene therapy 

Without being bound by tiieoiy it is thought that the metaUoprotease is up 
regulated during osteoarthritis in tissues. We have surprisingly found that a human 
disintegrin is up-regulated in human chondrocytes during osteoarthritic conditions. 
Inhibition of signal transduction mechanism is efficacious in disrupting tiie cascade of 
events in osteoarthritis and other diseases involving cartilage degeneration. The skUled 
artisan wiU recognize tiwt if up-regulation is a cause of the onset of artiiritis, then 
interfering with the activity of this gene may be usefiil in treating osteoartfmtis. 

This is done by any of several metiiods, inchiding gene (i.e., antisense) tiierapy. 
Inhibitors of the di«int,>grin 

The protease of the invention can be used to find inhibitors of the protease. 
Hence it is useful as a screening tool or for rational drug design. Witiiout being bound by 
tiieoiy, tht protease may modulate ceUular remodeling and in &ct may enhance 
extracellular matrix remodeling and tiius enhance tissue breakdown.. Hence inhibition of 
disintegrin provides a therapeutic route for treatment of diseases characterized by these 
processes. 

In screening, a drug compound can be used to determine both the quality and 
quantity of inhibition. As a result such screening provides information for selection of 
actives, preferably small ntolecule actives, which are useful in treating these diseases. 

In tiierapy, inhibition of disintegrin metaUoprotease activity via binding of small 
molecular weight, synthetic metaUoprotease inhibitors, such as tiiose used to inhibit the 
matrix metaUoproteases would be used to inhilnt extracellular matrix remodeling. 
Antibodies tnthppT^l^n 

MetaUoproteases active at a particularly undesired location (e.g., an organ or 
certain types of ceUs) can be targeted by conjugating a metaUoprotease inhibitor to a to an 
antibody or fragment tiiereof Conjugation methods are known in the art. 

The antibody of the invention can also be conjugated to soUd supports. These 
conjugates can be used as aflSnity reagents for ti»e purification of a desired 
metaUoprotease, preferably a disintegrin. 
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In another aspect, the antibody of the invention is directly conjugated to a label. 
As the antibody binds to the metalloprotease, the label can be used to detect the presence 
of relatively high levels of metalloprotease in vjve or in vitro cell culture. 

In addition, the metalloprotease inhibiting compounds can be conjugated to 
antibodies. Typical conjugation methods are known in the art. These antibodies are then 
useful both in therapy and in monitoring the dosage of the inhibitors. 

For example, targeting ligand which specifically reacts with a marker for the 
intended target tissue can be used. Methods for coupling the invention compound to the 
targeting ligand are weU known and are similar to those described below for coupling to 
carrier. The conjugates are formulated and administered as described above. 
Preparation and Use of Antibodies : 

Antibodies may be made by several methods, for example, the protein may be 
injected into suitable (e.g., mammalian) subjects including mice, rabbits, and the like. 
Preferred protocols involve repeated injection of the immunogen in the presence of 
adjuvants according to a schedule which boosts production of antibodies in the seium. 
The titers of the immune serum can readily be measured using immunoassay procedures, 
now standard in the art. 

The antisera obtained can be used direcUy or monoclonal antiboifies may be 
obtained by harvesting the peripheral blood lymphocytes or the spleen of the immunized 
animal and immortaliang the antibody-produdng cdls, followed by identifying the 
suitable antibody producers using standard immunoassay techniques. 

Polyclonal or monoclonal prq)arations are usefiil m monitoring theii^)y or 
prophylaxis regimens involving the compounds of the invention. Suitable sanq)les such as 
those derived firom blood, serum, urin^ or saliva can be tested for the preseiace of Uie 
protein at various times during the treatment protocol using standard immunoassay 
teduiiques which employ the antibody prq>arations of the invention. 

These antibodies can also be coupled to labels such as sdntigrapluc labels, e.g., 
technetium 99 or 1-131, using standard coupling methods. The labeled compounds are 
administered to subjects to determine the locations of excess amounts of one or more 
metalloproteases in vivo. Hence a labelled antibody to the protdn would operate a a 
screening tool for such enhanced expression, indicating the ifisease. 

The ability of the antibodies to bind metalloprotease selectively is tlms taken 
advantage of to map the distribution of these enzymes in situ. The techniques can also be 
employed in histological procedures and the labeled antibodies can be used in competitive 
immunoassays. 
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Antibodies are advanugeousiy coupled to other compounds or materials usmg 
known methods. For example, materials having a carboxyl fimctionality, the carboxyl 
residue can be reduced to an aldehyde and coupled to carrier through reaction with side 
chain amino groups, optionally followed by reduction of imino linkage fonnesfi. The 
carboxyl residue can also be reacted with side chain amino groups using condensing 
agents such as dicyclohexyl cari)odiimide or other carbodiimide dehydrating agents. 
Linker compounds can also be used to effect the coupling; both homobifiinttional and 
heterobifUnctional linkers are available from Pierce Chemical Company, Rockford. 111. 

These antibodies, when conjugated to a suitable chromatography material are 
useful in isolating the protein. Separation methods using affinity chromatography are 
well known in the art, and are within the purview of the skilled artisan. 

Without being bound by theoiy, expression of genes, and preferably this gene may 
have a restricted tissue distribution and its expression is up regulated by potential 
osteoarthritis mediators. Enhanced expression of this gene (and hence its protein) for 
example, in articular chondrocytes provides a marker to monitor the development, 
including the earliest, asymptomatic stages, and the progression of osteoarthritis. Hence 
an antibody raised to the protein would operate a a screening tool for such enhanced 
expression, indicating the disease. 

In addition, what used in a disease screen, antibodies can be conjugated to 
diromophore or fluorophore containing materials, or can be conjugated to enzymes which 
produce chromophores or fluorophores in certain conditions. These conjugating 
materials and methods arc well known in the art. When used in this manner detection of 
the protein by immunoassay is straightforward to the skilled artisan. Body fluids, for 
example can be screened in this manner for calibration, and detection of distribution of 
metalloproteases, or increased levels of these proteases. 

When used in this w^ the invention is a usefiil diagnostic and/or clinical marker 
for metalloprotease mediated diseases, such as osteoarthritis or other articular cartilage 
degenerative diseases. When disease is detected, it may be treated before the onset of 
symptom or debilitation. 

Furthermore, such antibodies can be used to target diseased tissue, for detection 
or treatment as described above. 

The following non-limiting examples illustrate a preferred embodiment of the 
present invention, and briefly describe the uses of the present invention. These 
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examples are provided for the guidance of the skilled artisan, and do not limit the 
invention in any way. Armed with this disclosure and these examples the skilled 
artisan is capable of making and using the claimed invention. 

Standard starting materiak are used for these examples. Many of these materials 
are known and commercially available. For example E. coU CJ236 and JMlOl are 
known strains, pUBllO is a known plasmid and Kunkd method mutagenesis is also weU 
known in the art. 

Variants may be made by expression systems and by various methods in various 
hosts, these methods are within the scope of the practice of the skilled artisan in 
molecular biology, biochemistiy or other arts related to biotechnolo^. 

Example 1 

RNA was isolated from unstimulated and interleukin-1 stimulated cultures of 
normal human articular chondrocytes. The RNA was reverse transcribed into cDNA 
The cDNA was subjected to a modified differential display procedure using a series of 
random primers. 

PCR samples generated from both stimulated and unstimulated dKNidrocytes were 
electrophoreses in adjacent lanes on polyaciylamide gels. The differentially expressed 
band was excised from the gel, cloned, and sequenced. The differential expression of the 
gene was confirmed by RNase protection and nuclear run on expoimoits. 

Example 2 

A novel partial human cDNA coding the proton is cloned from primary cultures f 
interleukin-1 stimulated human articular (femoral head) chondrocytes, using known 
methods. 

The same sequence is found, and the gene completed by screening of human cDNA 
libraries to obtain full length clones. 

Examplg 3 

The cloned DNA of example 2 was placed in pUB 110 using known methods. 

This plasmid is used to transform R coli and provides a tonplate fijr ate-directed 
mutagenesis to create new mutants. Kunkel method mutagenesis performed altering GLN 
lALA 

Example 4 

[12^1] disintegrin antibody is prepared using lOOOBEADS (Pierce, Rockford, IL; 
immobilized chloramine-T on nonporous polystyrene beads). Lyophilized antibody (2 |ig) 
is taken up in 50 ^1 of 10 mM acetic acid and added to 450 ^t of phosphate-buffered 
saline (PBS) (Sigma, St. Louis, MO) on ice. To the tube is added 500 ^Curie of ^^^I 
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(Amersham, Arlington Heights, IL) (2200Ci/minoI) in 5 and one lODOBEAD. The 
reaction is incubated on tee for 10 min with occasional shaking. The reaction is then 
terminated by removal of the reaction from the lODOBEAD. To remove unreacted 125i^ 
the mixture is applied to a PD-10 gd filtration column. 

Examples 

A fluorogenic peptide (Bachem, Guelph MiUs, King of Prussia. Pa) is mixed with 
the disintegrih and change in the fluorecence is evaluated at 2 min, as a control.. Then 
the .fluorogenic peptide is mixed with the disintegrin in the presence of the compound in 
evaluation in a separate run, with evaluation at 2 minutes. Data are evaluated using 
standard methodology to provide relative binding of tiie evaluated compound. 

Example 6 

0.5ml of synovial fluid from tiie left knee of a patient is withdrawn and tested for 
elevated levels disintegrin by EUSA. The results indicate higher than normal disintegrin 
level. The patient is prescribed a prophylactic dose of a disintegrin inhibitor, and is 
administered an injection of same in the left knee before leaving the clinician's office. 

Example 7 

Inhibition of extraceUular matrix remodeling is explored via inhibition of 
disintegrin metaUoprotease activity. Using a smaU molecular weight, synthetic 
metalloprotease inhibitor, such as Uiose used to inhibit the matrix metalloprotcases, tissue 
int^rety and proteo^ycan is monitored. 

A sample of mouse derived articular cartilage is grown in a I micromolar solution 
of a smaU molecular weight disintegrin inhibitor. The experiment is controlled and 
compared to an identical culture grown witii no inhibitor. 

The assay of the culture after 7 days shows that the inhibited culture has less 
tissue breakdown and less proteoglycan present in the serum of the culture. The result is 
consistant witii the inhHiitedaggrecanase activity. Inhibition ofaggrecanase would inhibit 
tissue breakdown and reduce die release of proteoglycan. 

Example 8 

Inhibition of proteolytic processing resulting in tiie release from tiie membrane 
bound form of ti:e disintegrin metalloprotease domain inhibits "second messenger- 
signaling oftiie membrane bound diantegrin molecule. Such second messenger signaling 
would result in ceUular phenotypic changes, changes in gene expression, changes in 
mitotic activity, and tiie like. 

Cells known to contain disintegrin are treated with a serine protease. Proteins 
released from the cell are measured by standard metiiods! Specifically the 
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metalloprotease activity is monitored via literature methods. The amount of 
metalloprotease released is correlated to the amount of serine protease used to treat the 

cells. 

Increaeses, versus conu-ol, in src tyrosine kinase activity are measured by Western' blot 
analysis of intracellular proteins using monoclonal antibodies specific for phosphotyrosine 
following cleavage and release of the disintegrin metalloprotease. Controls are cells that 
have not been treated with serine protease. 

src tyrosine kinase activity in the cell (or is it cell culture) is measured by literature 
meathods. Release of the metalloprotease domain of the disintegrin is also monitored via 
literature methods. There is a direct correlation between release of the metalloprotease 
domain and increases in intracellular src tyrosine kinase activity. This lesuh is consistent 
with stimulation of diant^n-mediated cell agnalling by stimulation of the src tyrosine 
kinase cascade. 

Eiamnle 9 

Inhibition of intercellular adhesion molecules, or extracellular matrix components 
results in the inhibition of phenotypic changes, including changes in cell shape, associated 
with such interactions, as described in Example 8. 

Integrin binding is measured witii a pq>tide containing the sequence RGD, unng 
the protocol of Example 8. Integrin binding is measured via competitive assay, uang 
cellular changes in shape viable via microscopy. The peptide inhibits the cellular changes 
as in Example 8. 

Tlus result is conastent with competition with or blocking of the interaction of 
disintegrin. The RGD peptide inhibits celluhu* changes in chondrocytes. The 
osteoarthritis phenotype, charactatsed by increased matrix synthesis and accderated 
matrix metalloprotease activity does not occur. Other rea^y assayable cdhilar dianges 
can be used to moniter tMs result, including gene expresaon, changes in mitotic activity, 
and the like. 

Example 10 

A small molecular wdght metalloprotease inhibitor is used to treat a tissue 
culture according to the method of Example 7. The release of TNF-a from the cell 
membrane is measured by literature methods. The inhibitor of Example 7 also decreases 
the amount of TNF-a secreted from the cell membrane. 

This is conastant with the theory that inhibition of disintegrin metalloprotease 
activity will result in the inlubition of a diantegrin associated inflammation cascade and 
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secretase activity. It is contemplated that monitoring the release of cytokines or IL-1 
from the cell membrane, and the like will produce the same result. 

All references described herdn ve hereby incorporated by reference. 

While particular embodiments of the subject invention have been described, it will 
be obvious to those skUled in the art that various changes and modifications of the subject 
invention can be made without departing from the spirit and scope of the invention. It is 
intended to cover, in the appended claims, all such modifications that are within the scope 
of this invention. 
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(E) COUNTRY: USA 

(F) ZIP: 45040-9462 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC conpatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: - 

(A) APPLICATION NUMBER: US 60/012,679 

(B) FILING DATE: Ol-MAR-1996 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: HAKE, RICHARD A. 

(B) REGISTRATION NUMBER: 37,343 

(C) REFERENCE/DOCKET NUMBER: 5980 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 513/622-0087 

(B) TELEFAX: 513/622-0270 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1961 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2.. 1474 



(xi) SEQUENCE DESCRIPTION: SEQ ID H0:1: 
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^ t^^ t?^ CGT AAC ATC ACT TTC ATG GTG 

Gin Thr Thr Asp Phe Ser Gly He Arg Asn He Ser Phe Met Val 
1 5 10 . 15 

AAA CGC ATA AGA ATC AAT ACA ACT GCT GAT GAG AAG GAG CCT ACA AAT 
Lys Arg lie Arg lie Asn Thr Thr Ala Asp Glu Lys Asp Pro tS 

CCT TTC CGT TTC CCA AAT ATT ACT CTG GAG AAG TTT CTG GAA TTG AAT 
Pro Phe Arg Phe Pro Asn He Ser Val Glu Lys Phe Leu Glu Leu Asn 
35 40 45 

l?r T''^ ^'^ "^^^ ^'^T <=TC TTC ACA GAC 

Ser Glu Gin Asn Has Asp Asp Tyr Cys Leu Ala Tyr Val Phe Thr Asp 
50 55 



60 



220 



46 



94 



142 



190 



238 



382 



CGA GAT TTT GAT GAT GGC CTA CTT GCT CTG GCT TGG GTT GGA GCA CCT 
Arg Asp Phe Asp Asp Gly Val Leu Gly Leu Ala Trp Val Gly Ala Pro 
6= 70 75 

TCA GGA AGC TCT GGA GGA ATA TCT GAA AAA ACT AAA CTC TAT TCA GAT 286 
ser Gly Ser Ser Gly Gly He Cys Glu Lys Ser Lys Leu Tyr Ser Asp 

85 90 95 

GGT AAG AAG AAG TCC TTA AAC ACT GGA ATT ATT ACT CTT CAG AAC TAT 334 
Gly Lys Lys Lys Ser Leu Asn Thr Gly He He Thr Val Gin Aan Tyr 
100 105 110 

GGG TCT CAT CTA CCT CCC AAA CTC TCT CAG ATT ACT TTT GCT CAC GAA 
Gly Ser Hxs Val Pro Pro Lys Val Ser His He Thr Phe Ala His Glu 
115 120 125 

CTT GGA CAT AAC TTT GGA TCC CCA CAT GAT TCT GGA ACA GAG TGC ACA 430 
Val Gly His Asn Phe Gly Ser Pro His Asp Ser Gly Thr Glu Cys Thr 
130 135 HQ 

CCA GGA GAA TCT AAG AAT TTG GGT CAA AAA GAA AAT GGC AAT TAC ATC 478 
Pro Gly Glu Ser Lys Asn Leu Gly Gin Lys Glu Asn Gly Asn Tyr He 
1*= 150 155 

ATG TAT GCA AGA GCA ACA TCT GGG GAC AAA CTT AAC AAC AAT AAA TTC 526 
Met Tyr Ala Arg Ala Thr Ser Gly Asp Lys Leu Asn Asn Asn Lys Phe 
160 165 170 175 

TCA CTC TGT ACT ATT AGA AAT ATA AGC CAA CTT CTT GAG AAG AAG AGA 574 
Ser Leu Cys Ser He Arg Asn He Ser Gin Val Leu Glu Lys Lys Ara 
180 185 ^ 190 

AAC AAC TCT TTT- CTT GAA TCT GGC CAA CCT ATT TGT GGA AAT GGA ATG 
Asn Asn Cys Phe Val Glu Ser Gly Gin Pro He Cys Gly Asn Gly Met 
195^ 200 205 

GTA GAA CAA GCT GAA GAA TCT GAT TGT GGC TAT AGT GAC CAG TCT AAA 
Val Glu Gin Gly Glu Glu Cys Asp Cys Gly Tyr Ser Asp Gin Cys Lys 
210 215 Jr 



622 



670 



GAT GAA TGC TGC TTC GAT GCA AAT CAA CCA GAG GGA AGA AAA TGC AAA 718 
Asp Glu Cys Cys Phe Asp Ala Asn Gin Pro Glu Gly Arg Lys Cys Lys 
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225 230 235 

CTG AAA CCT GGG AAA CAG TGC AGT CCA AGT CAA GGT CCT TGT TGT ACA 766 
Leu Lys Pro Gly Lys Gin Cys Ser Pro Scr Gin Gly Pro Cys Cys Thr 
240 245 250 255 

GCA CAG TGT GCA TTC AAG TCA AAG TCT GAG AAG TGT CGG GAT GAT TCA ' 814 
Ala Gin Cys Ala Phe Lys Ser Lys Ser Glu Lys Cys Arg Asp Asp Ser 
260 265 270 

GAC TGT GCA AGG GAA GGA ATA TGT AAT GGC TTC ACA GCT CTC TGC CCA 862 
Asp Cys Ala Arg Glu Gly lie Cys Asn Gly Phe Thr Ala Leu Cys Pro 
275 280 285 

GCA TCT GAC CCT AAA CCA AAC TTC ACA GAC TGT AAT AGG CAT ACA CAA 910 
Ala Ser Asp Pro Lys Pro Asn Phe Thr Asp Cys Asn Arg His Thr Gin 
290 295 300 

GTG TGC ATT AAT GGG CAA TGT GCA GGT TCT ATC TGT GAG AAA TAT GGC 958 
Val Cys lie Asn Gly Gin Cys Ala Gly Ser lie Cys Glu Lys Tyr Gly 
305 310 315 

TTA GAG GAG TGT ACG TGT GCC AGT TCT GAT GGC AAA GAT GAT AAA GAA 1006 
Leu Glu Glu Cys Thr Cys Ala Ser Ser Asp Gly Lys Asp Asp Lys Glu 
320 325 330 335 

TTA TGC CAT GTA TGC TGT ATG AAG AAA ATG GAC CCA TCA ACT TGT GCC 1054 
Leu Cys His Val Cys Cys Met Lys Lys Met Asp Pro Ser Thr Cys Ala 
340 345 350 

AGT ACA GGG TCT GTG CAG TGG AGT AGG CAC TTC AGT GGT CGA ACC ATC 1102 
Ser Thr Gly Ser Val Gin Trp Ser Arg His Phe Ser Gly Arg Thr lie 
355 360 365 

ACC CTG CAA CCT GGA TCC CCT TGC AAC GAT TTT AGA GGT TAC TGT GAT 1150 
Thr Leu Gin Pro Gly Ser Pro Cys Asn Asp Phe Arg Gly Tyr Cys Asp 
370 375 380 

GTT TTC ATG CGG TGC AGA TTA GTA GAT GCT GAT GGT CCT CTA GCT AGG 1198 
Val Phe Met Arg Cys Arg Leu Val Asp Ala Asp Gly Pro Leu Ala Arg 

385 390 395 

CTT AAA AAA GCA ATT TTT AGT CCA GAG CTC TAT GAA AAC ATT GCT GAA 1246 
Leu Lys Lys Ala lie Phe Ser Pro Glu Leu Tyr Glu Asn lie Ala Glu 
400 405 410 415 

TGG ATT GTG GCT CAT TGG TGG GCA GTA TTA CTT ATG GGA ATT GCT CTG 1294 
Trp He Val Ala His Trp Trp Ala Val Leu Leu Met Gly He Ala Leu 
420 425 430 

ATC ATG CTA ATG GCT GGA TTT ATT AAG ATA TGC AGT GTT CAT ACT CCA 1342 
He Met Leu Met Ala Gly Phe He Lys He Cys Ser Val His Thr Pro 
435 440 445 

AGT AGT AAT CCA AAG TTG CCT CCT CCT AAA CCA CTT CCA GGC ACT TTA 1390 
Ser Ser Asn Pro Lys Leu Pro Pro Pro Lys Pro Leu Pro Gly Thr Leu 
450 455 460 



AAG AGG AGG AGA CCT CCA CAG CCC ATT CAG CAA CCC CAG CGT CAG CGG 



1438 
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Lys Arg Arg Arg Pro Pro Gin Pro lie Gin Gin Pro Gin Arg Gin Arg 
465 470 475 

CCC CGA GAG AGT TAT CAA ATG GGA CAC ATG AGA CGC TAACTGCAGC 1484 
Pro Arg Glu Ser Tyr Gin Met Gly His Met Arg Arg 
480 485 490 

TTTTGCCTTG GTTCTTCCTA GTGCCTACAA TGGGAAAACT TCACTCCAAA GAGAAACCTA 1544 

TTAAGTCATC ATCTCCAAAC TAAACCCTCA CAAGTAACAG TTGAAGAAAA AATGGCAAGA 1604 

GATCATATCC TCAGACCAGG TGGAATTACT TAAATTTTAA AGCCTGAAAA TTCCAATTTG 1664 

GGGGTGGGAG GTGGAAAAGG AACCCAATTT TCTTATGAAC AGATATTTTT AACTTAATGG 1724 

CACAAAGTCT TAGAATATTA TTATGTGCCC CGTGTTCCCT GTTCTTCGTT GCTGCATTTT 1784 

CTTCACTTGC AGGCAAACTT GGCTCTCAAT AAACTTTTCG GTCCAGACCA CAGACTTCTC 1844 

CGGAATCCGT AACATCA6TT TCATGGTGAA ACGCATAAGA ATCAATACAA CTGCTGATGA 1904 

GAAGGACCCT ACAAATCCTT TCCGTTTCCC AAATATTAGT GTGGAGAAGT TAAACAA 1961 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 491 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

Gin Thr Thr Asp Phe Ser Gly He Arg Asn He Ser Phe Met Val Lys 
IS 10 15 

Arg lie Arg He Asn Thr Thr Ala Asp Glu Lys Asp Pro Thr Asn Pro 
20 25 30 

Phe Arg Phe Pro Asn He Ser Val Glu Lys Phe Leu Glu Leu Asn Ser 
35 40 45 

Glu Gin Asn His Asp Asp Tyr Cys Leu Ala Tyr Val Phe Thr Asp Arg 
50 55 60 

Asp Phe Asp Asp Gly Val Leu Gly Leu Ala Trp Val Gly Ala Pro Ser 
€5 70 75 80 

Gly Ser Ser Gly Gly He Cys Glu Lys Ser Lys Leu Tyr Ser Asp Gly 
85 90 95 

Lys Lys Lys Ser Leu Asn Thr Gly He He Thr Val Gin Asn Tyr Gly 
100 105 110 

Ser His Val Pro Pro Lys Val Ser His He Thr Phe Ala His Glu Val 

U5 120 125 



wo 97/31931 
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Gly His Asn Phe Gly Ser Pro His Asp Ser Gly Thr Glu Cys Thr Pro 

135 140 

Gly Glu Ser Lys Asn Leu Gly Gin Lys Glu Asn Gly Asn Tyr He Met 

"0 155 

Tyr Ala Arg Ala Thr Ser Gly Asp Lys Leu Asn Asn Asn Lys Phe Ser 
165 170 175 

Leu Cys Ser He Arg Asn He Ser Gin Val Leu Glu Lys Lys Arg Asn 

165 190 

Asn Cys Phe Val Glu Ser Gly Gin Pro He Cys Gly Asn Gly Met Val 

200 205 

Glu Gin Gly Glu Glu Cys Asp Cys Gly Tyr Ser Asp Gin Cys Lys Asp 



215 



220 



Glu Cys Cys Phe Asp Ala Asn Gin Pro Glu Gly Arg Lys Cys Lys Leu 



230 



235 



240 



Lys Pro Gly Lys Gin Cys Ser Pro Ser Gin Gly Pro Cys Cys Thr Ala 
245 250 255 

Gin Cys Ala Phe Lys Ser Lys Ser Glu Lys Cys Arg Asp Asp Ser Asp 
260 265 270 

Cys Ala Arg Glu Gly He Cys Asn Gly Phe Thr Ala Leu Cys Pro Ala 
275 280 285 

Ser Asp Pro Lys Pro Asn Phe Thr Asp Cys Asn Arg His Thr Gin Val 
290 295 300 

Cys He Asn Gly Gin Cys Ala Gly Ser He Cys Glu Lys Tyr Gly Leu 
305 310 315 320 

Glu Glu Cys Thr Cys Ala Ser Ser Asp Gly Lys Asp Asp Lys Glu Leu 
325 330 335 

Cys His Val Cys Cys Met Lys Lys Met Asp Pro Ser Thr Cys Ala Ser 
340 345 350 

Thr Gly Ser Val Gin Trp Ser Arg His Phe Ser Gly Arg Thr He Thr 
355 360 365 

Leu Gin Pro Gly Ser Pro Cys Asn Asp Phe Arg Gly Tyr Cys Asp Val 
370 375 380 

Phe Met Arg Cys Arg Leu Val Asp Ala Asp Gly Pro Leu Ala Arg Leu 
385 390 395 400 

Lys Lys Ala He Phe Ser Pro Glu Leu Tyr Glu Asn He Ala Glu Trp 
405 410 415 

He Val Ala His Trp Trp Ala Val Leu Leu Met Gly He Ala Leu He 
420 425 430 

Met Leu Met Ala Gly Phe He Lys He Cys Ser Val His Thr Pro Ser 
435 440 445 
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Ser Asn Pro Lys Leu Pro Pro Pro Lys Pro Leu Pro Gly Thr Leu Lys 

455 

Arg Arg Arg Pro Pro Gin Pro He Gin Gin Pro Gin Arg Gin Arg Pro 

475 

Arg Glu Ser Tyr Gin Met Gly His Met Arg Arg 
485 

(2) INFORMATION FOR SEQ ID NO: 3: 

. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2763 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 17., 2414 



97 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 

G6CGGCGGCA CGGAAG ATG GTG TTG CTG AGA GTG TTA ATT CTG CTC CTC 49 

Met Val Leu Leu Arg Val Leu He Leu Leu Leu 
495 500 

TCC TGG GCG GCG GGG ATG GGA GGT CAG TAT GGG AAT CCT TTA AAT AAA 
ser Trp Ala Ala Gly Met Gly Gly Gin Tyr Gly ITo JJn 

510 515 

Ivl ill ^ ^'^^ ^"^^^ TCA TTA CAC 145 

^ lit ^ ^^"^ Asn Val Asp Ser Leu His 

^20 525 

CAA AAA CAC CAG CGT GCC AAA AGA GCA GTC TCA CAT GAA GAC CAA TTT 193 
Gin Lys His Gin Arg Ala Lys Arg Ala Val Ser His Glu Asp Sn pJI 

540 545 

TTA CGT CTA GAT TTC CAT GCC CAT GGA AGA CAT TTC AAC CTA CGA ATG 
Leu Arg Leu Asp Phe.His Ala His Gly Arg His pSe ^n Zu 51? 

555 560 565 

AAG AGG GAC ACT TCC CTT TTC AGT GAT GAA TTT AAA GTA GAA ACA TCA 289 
Lys Arg Asp Thr Ser Leu Phe Ser Asp Glu Phe Lys Val Glu Thr Ser 
570 575 580 

AAT AAA GTA CTT GAT TAT GAT ACC TCT CAT ATT TAC ACT GGA CAT ATT 337 
Asn Lys Val Leu Asp Tyr Asp Thr Ser His He Tyr Thr Gly His lie 
585 590 



595 



Ivl 'Sl'^'^n'f^ IT "^^^ ATT GAT GGA AGA 385 

Tyr Gly Glu Glu Gly Ser Phe Ser His Gly Ser Val He Asp Gly Arg 
600 605 610 
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TTT GAA GGA TTC ATC CAG ACT CGT GGT GGC ACA TTT TAT GTT GAG CCA 433 
Phe Glu Gly Phe He Gin Thr Arg Gly Gly Thr Phe Tyr Val Glu Pro 
615 620 625 630 

GCA GAG AGA TAT ATT AAA GAC CGA ACT CTG CCA TTT CAC TCT GTC ATT ^ 481 
Ala Glu Arg Tyr lie Lys Asp Arg Thr Leu Pro Phe His Ser Val He 
635 640 645 

TAT CAT GAA GAT GAT ATT AGT GAA AGG CTT AAA CTG AGG CTT AGA AAA 529 
Tyr His Glu Asp Asp He Ser Glu Arg Leu Lys Leu Arg Leu Arg Lys 
650 655 660 

CTT ATG TCA CTT GAG TTG TGG ACC TCC TGT TGT TTA CCC TGT GCT CTT 577 
Leu Met Ser Leu Glu Leu Trp Thr Ser Cys Cys Leu Pro Cys Ala Leu 
665 670 675 

CTG CTT CAC TCA TGG AAG AAA GCT 6TA AAT TCT CAC TGC CTT TAC TTC 625 
Leu Leu His Ser Trp Lys Lys Ala Val Asn Ser His Cys Leu Tyr Phe 
680 685 690 

AAG GAT TTC TGG GGC TTT TCT GAA ATC TAC TAT CCC CAT AAA TAC GGT 673 
Lys Asp Phe Trp Gly Phe Ser Glu He Tyr Tyr Pro His Lys Tyr Gly 
695 700 705 710 

CCT CAG GGC GGC TGT GCA GAT CAT TCA GTA TTT GAA AGA ATG AGG AAA 721 
Pro Gin Gly Gly Cys Ala Asp His Ser Val Phe Glu Arg Net Arg Lys 
715 720 725 

TAC CAG ATG ACT GGT GTA GAG GAA GTA ACA CAG ATA CCT CAA GAA GAA 769 
Tyr Gin Met Thr Gly Val Glu Glu Val Thr Gin He Pro Gin Glu Glu 
730 735 740 

CAT GCT GCT AAT GGT CCA GAA CTT CTG AGG AAA AGA CGT ACA ACT TCA 817 
His Ala Ala Asn Gly Pro Glu Leu Leu Arg Lys Arg Arg Thr Thr Ser 
745 750 755 

GCT GAA AAA AAT ACT TGT CAG CTT TAT ATT CAG ACT GAT CAT TTG TTC 865 
Ala Glu Lys Asn Thr Cys Gin Leu Tyr He Gin Thr Asp His Leu Phe 
760 765 770 

TTT AAA TAT TAC GGA ACA CGA GAA GCT GTG ATT GCC CAG ATA TCC AGT 913 
Phe Lys Tyr Tyr Gly Thr Arg Glu Ala Val He Ala Gin He Ser Ser 

775 780 785 790 

CAT GTT AAA GCG ATT GAT ACA ATT TAC CAG ACC ACA GAC TTC TCC GGA 961 
His Val Lys Ala He Asp Thr He Tyr Gin Thr Thr Asp Phe Ser Gly 
795 800 805 

ATC CGT AAC ATC AGT TTC ATG GTG AAA CGC ATA AGA ATC AAT ACA ACT 1009 
He Arg Asn He .Ser Phe Met Val Lys Arg He Arg He Asn Thr Thr 
810 815 820 

GCT GAT GAG AAG GAC CCT ACA AAT CCT TTC CGT TTC CCA AAT ATT- AGT 1057 
Ala Asp Glu Lys Asp Pro Thr Asn Pro Phe Arg Phe Pro Asn He Ser 
825 830 B35 

GTG GAG AAG TTT CTG GAA TTG AAT TCT GAG CAG AAT CAT GAT GAC TAC 1105 
Val Glu Lys Phe Leu Glu Leu Asn Ser Glu Gin Asn His Asp Asp Tyr 
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840 845 850 

T6T TTG GCC TAT GTC TTC ACA GAC CGA GAT TTT GAT GAT GGC GTA CTT 1153 
Cys Leu Ala Tyr Val Phe Thr Asp Arg Asp Phe Asp Asp Gly Val Leu 
855 860 865 



870 

GGT CTG GCT TGG 6TT GGA GCA CCT TCA GGA AGC TCT GGA GGA ATA TGT ' 1201 
Gly Leu Ala Trp Val Gly Ala Pro Ser Gly Ser Ser Gly Gly lie Cys 
875 880 885 

GAA AAA AGT AAA CTG TAT TCA GAT GGT AAG AAG AAG TCC TTA AAC ACT 1249 
Glu Lys Ser Lys Leu Tyr Ser Asp Gly Lys Lya Lys Ser Leu Asn Thr 
890 895 900 

GGA ATT ATT ACT GTT CAG AAC TAT GGG TCT CAT GTA CCT CCC AAA GTC 1297 
Gly lie lie Thr Val Gin Asn Tyr Gly Ser His Val Pro Pro Lys Val 
905 910 915 

TCT CAC ATT ACT TTT GCT CAC GAA GTT GGA CAT AAC TTT GGA TCC CCA 1345 
Ser His lie Thr Phe Ala His Glu Val Gly His Asn Phe Gly Ser Pro 
920 925 930 

CAT GAT TCT GGA ACA GAG TGC ACA CCA GGA GAA TCT AAG AAT TTG GGT 1393 
His Asp Ser Gly Thr Glu Cys Thr Pro Gly Glu Ser Lys Asn Leu Glv 
935 940 945 950 

CAA AAA GAA AAT GGC AAT TAC ATC ATG TAT GCA AGA GCA ACA TCT GGG 1441 
Gin Lys Glu Asn Gly Asn Tyr He Met Tyr Ala Arg Ala Thr Ser Gly 
955 960 965 

GAC AAA CTT AAC AAC AAT AAA TTC TCA CTC TGT AGT ATT AGA AAT ATA 1489 
Asp Lys Leu Asn Asn Asn Lys Phe Ser Leu Cys Ser He Arg Asn He 
970 975 980 

AGC CAA GTT CTT GAG AAG AAG AGA AAC AAC TGT TTT GTT GAA TCT GGC 1537 
Ser Gin Val Leu Glu Lys Lys Arg Asn Asn Cys Phe Val Glu Ser Gly 
985 990 995 

CAA CCT ATT TGT GGA AAT GGA ATG GTA GAA CAA GGT GAA GAA TGT GAT 1585 
Gin Pro He Cys Gly Asn Gly Met Val Glu Gin Gly Glu Glu Cys Asp 
1000 1005 1010 

TGT GGC TAT AGT GAC CAG TGT AAA GAT GAA TGC TGC TTC GAT GCA AAT 1633 
Cys Gly Tyr Ser Asp Gin Cys Lys Asp Glu Cys Cys Phe Asp Ala Asn 
1015 1020 1025 1030 

CAA CCA GAG GGA AGA AAA TGC AAA CTG AAA CCT GGG AAA CAG TGC AGT 1681 
Gin Pro Glu Gly Arg Lys Cys Lys Leu Lys Pro Gly Lys Gin Cys Ser 
. 1035 1040 1045 

CCA AGT CAA GGT CCT TGT TGT ACA GCA CAG TGT GCA TTC AAG TCA AAG 1729 
Pro Ser Gin Gly Pro Cys Cys Thr Ala Gin Cys Ala Phe Lys Ser Lys 
1050 1055 1060 

TCT GAG AAG TGT CGG GAT GAT TCA GAC TGT GCA AGG GAA GGA ATA TGT 1777 
Ser Glu Lys Cys Arg Asp Asp Ser Asp Cys Ala Arg Glu Gly He Cys 
1065 1070 1075 



AAT GGC TTC ACA GCT CTC TGC CCA GCA TCT GAC CCT AAA CCA AAC TTC 1825 
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Asn Gly Phe Thr Ala Leu Cys Pro Ala Ser Asp Pro Lys Pro Asn Phe 
1080 1085 1090 

ACA GAC TGT AAT AGG CAT ACA CAA GTG TGC ATT AAT GGG CAA TGT GCA 
Thr Asp Cys Asn Arg His Thr Gin Val Cys He Asn Gly Gin Cvs Ala 
1095 1100 1105 1110 

GGT TCT ATC TGT GAG AAA TAT GGC TTA GAG GAG TGT ACG TGT GCC AGT 
Gly Ser He Cys Glu Lys Tyr Gly Leu Glu Glu Cys Thr Cys Ala Ser 
1115 1120 1125 

TCT GAT GGC AAA GAT GAT AAA GAA TTA TGC CAT GTA TGC TGT ATG AAG 
Ser Asp Gly Lys Asp Asp Lys Glu Leu Cys His Val Cys Cys Met Lys 
1130 1135 1140 

AAA ATG GAC CCA TCA ACT TGT GCC AGT ACA GGG TCT GTG CAG TGG AGT 
Lys Met Asp Pro Ser Thr Cys Ala Ser Thr Gly Ser Val Gin Trp Ser 
1145 1150 1155 

AGG CAC TTC AGT GGT CGA ACC ATC ACC CTG CAA OCT GGA TCC CCT TGC 
Arg His Phe Ser Gly Arg Thr He Thr Leu Gin Pro Gly Ser Pro Cys 
1160 1165 1170 

AAC GAT TTT AGA GGT TAC TGT GAT GTT TTC ATG CGG TGC AGA TTA GTA 
Asn Asp Phe Arg Gly Tyr Cys Asp Val Phe Met Arg Cys Arg Leu Val 
11"^5 1180 1185 1190 

GAT GCT GAT GGT CCT CTA GCT AGG CTT AAA AAA GCA ATT TTT AGT CCA 
Asp Ala Asp Gly Pro Leu Ala Arg Leu Lys Lys Ala He Phe Ser Pro 
1195 1200 1205 

GAG CTC TAT GAA AAC ATT GCT GAA TGG ATT GTG GCT CAT TGG TGG GCA 
Glu Leu Tyr Glu Asn He Ala Glu Trp He Val Ala His Trp Trp Ala 
1210 1215 1220 

GTA TTA CTT ATG GGA ATT GCT CTG ATC ATG CTA ATG GCT GGA TTT ATT 
Val Leu Leu Met Gly He Ala Leu He Met Leu Met Ala Gly Phe He 
1225 1230 1235 

AAG ATA TGC AGT GTT CAT ACT CCA AGT AGT AAT CCA AAG TTG CCT CCT 
Lys He Cys Ser Val His Thr Pro Ser Ser Asn Pro Lys Leu Pro Pro 
1240 1245 1250 

CCT AAA CCA CTT CCA GGC ACT TTA AAG AGG AGG AGA CCT CCA CAG CCC 
Pro Lys Pro Leu Pro Gly Thr Leu Lys Arg Arg Arg Pro Pro Gin Pro 
1255 1260 1265 1270 

ATT CAG CAA CCC CAG CGT CAG CGG CCC CGA GAG AGT TAT CAA ATG GGA 
He Gin Gin Pro Gin Arg Gin Arg Pro Arg Glu Ser Tyr Gin Met Gly 
1275 1280 1285 

CAC ATG AGA CGC T AACTGCAGCT TTTGCCTTGG TTCTTCCTAG TGCCTACAAT 
His Met Arg Arg 
1290 

GGGAAAACTT CACTCCAAAG AGAAACCTAT TAAGTCATCA TCTCCAAACT AAACCCTCAC 
AAGTAACAGT TGAAGAAAAA ATGGCAAGAG ATCATATCCT CAGACCAGGT GGAATTACTT 



1873 

1921 

1969 

2017 

2065 

2113 

2161 

2209 

2257 

2305 

2353 

2401 

2454 

2514 
2574 
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AAATTTPTAAA. CSC:C*rGAAAA.T TddKJKT'T'rGG GCSGtT GCSCLA.GG TGSAAAAAOGA. A.CCCAJVrTYT 2 €3 4 

CXTA.T'CSAA.CA C&A.T^TTYT'r/V A.CTTAA.*rO<SC ACAAAGTC'TT' AOAAJrArrTAT 7A,T<STCSCCCC 2 69 4 

CSTCSTTCCCrrC TTCTTCGTXO 77CA.CT'T'GCJV GOOAAACTT'G GCTCTCAATA 



(2> ZNS*ORK/V,7ZOM POR SEO XD NO s 4 s 

Ci) sjsoueimce: CK^vz%A.c-rE:i%zsTzcs s 

1^) X.Ea«C57Hi -799 &nxd.no Ae^c±s 
<B) TYPE 2 AmJ.no acd.cl 
(O) TOPOLOGY! Xd.nea£^ 

(dL±.) MOX^&CUX^E: TYPBs i>sot^ei.n 

(3Cd.> SKQUENCE DESCRZP7ZONt SBQ ZD NO S 4 s 

VAX Z«s\i g V»X X«eu ZXtt Z.«u Z.au X««u Ses Ai«h G3.y 

3. 5 XO XS 

<Siy GXy GXn rpyxr GXy E'jto X.ou .A^n r.vs tTyt ZXe JVxr9 HXa 

20 ZB 30 

Giy X*«avi Sef Tyt Ajsn VaI A^p Sot X-eu HJ.» GXn X«vs Mis GXn 
3S 40 4S 

r*y» -A^V Aiai Soar HXs GXu Aap GXn PHo Levi Asp Pt»e 

SO SS 60 

AXa Hd.a OXy Axrg MA» Phts A^n X«4bu l><«e X.ys J<Vf9 Asp Tiic 9^k 

"70 7S ao 

P»»« S«f -Asp GXu Pk\e Z-ys VaX GXu Thx- Smx Asn X.ys VaX X««u Aj»p 
es 90 95 

•ryjr Asp TftKf S«jr HJ.s XX« Ty« Tlrtjc GXy HXs ZXe Ty« GXy GXu GXu GXy 
XOO XOS ixo 

S«t Ptett SejT HXa GXy Sef VaX XX« Asp GXy Ajc^ PA\« GXvt GXy Phe IX« 
XXS X20 X25 

^^n- XfciJT A^9 GXy GXy TKf Phe Ty*r VaX GXu Paro AXa GXu A^^a Ty^ XXe 
X30 X3S X40 

Z.ys Asp Af 9 Ths X.ttu Pjre Phe HXs S«£ VaX XXe nrys HXs GXu Asp Asp 

XSO XSS X60 

ZX«i Sejc GXiJi Ajr9 Z.4BU Lya Z««ku Aarg Xiciu Ajcgr X.ys X.<9u Mw'b Sejc Z«ou GXu 
X6S X70 X-TS 

X.eu Trp Thit Ses Cys Cys Lovi PfO Cys AXa X««ui X.AU Z««Vt HXs SOJS 
X80 XeS XSO 

J*ys X.ys AXa VaX Asn HXs Cys X.«u Xyir PHo X.ya Asp PK« Tsp GXy 



27S4 
2763 
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AAATTTTAAA GCC7GAAAAT TCCAAXTTGG GGGTGGGAGG TGGAAAAGGA ACCCAATTTT 2 63 4 

CTTPATGAACA GAT-MPTTTTA A.Crr7AArrGGC ACAAAGTCTT AGAATA.'XrrJVXT TATGTGCCCC 2 694 

GTGTTPCCCTG TTCTTOSXTG CTGCATTXTC XXCACTTGCA GGCAAACTTG GCTCTCAArrA 27 54 

AACITXTCG '* 2 7 6 3 

f2> ZMFOmCATZOM rOR SBQ XD NO s 4 : 

<X) SSOUBNCB CHARACXBRZSTXCS X 

CA) X*BNOTMs 7 99 wnXno agJL^s 
(B) TVPEx AmXno meXci 
(D> TOPOLOGY s XXneajc 

(Xi) MOLBCULE XYPEs piro^oXn 

<acX> SEQUENCE DESCR.Z PXXON s SEQ XD NO ; 4 y 

Me-b V&X Lou L4IU AJrg VaX X.«u XX e X.*\» Lau X.eu S«x; Xsp AO.* AX a GXy 
a- S XO XS 

GXy GXy GXn Xy r GXy Asn Pfo Lou. A^n Lys Xyjr ZXe A^a HXs Xy*r 
20 25 30 

GXu GXy Lau S«f Xy« Asn VaX Asp S«j; X.«u MXs GXn Lys HXs GXn Ajto 
33 40 45 * 

AO.* ^SO '^^^ AXa VaX S«»v HXs GXu Asp GXn PHo L«u Af9 Z^u Aap Ptoa 
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HXs AXa HXs OXy A^9 His Ptm« Asn L«u A^9 >€«e Lys Af9 Asp VtxK S«e 

70 75 BO 

L«u PHa Sas Adp GXu PK« Lys VsX GXu TKf S«v Ausn Lys VmX L«u A^p 

85 90 95 

Xyar Asp XHt Sosr HXs IX«s Xyr Xhxr GXy HXs ZXa Xy« GXy GXu GXu GXy 

XOO X05 XXO 

Smx Pl^c» Soar HXs GXy Soar VmX IXo A^p GXy Aarg Ph»o GXu GXy Pho IXo 

XXS X20 X2S 

GXn XKc AJC9 GXy GXy XKa; PHo Xyar VaX GXu Px^o AXs GXu Ajc9 Xy^ XXo 

X30 X35 X40 

Lys As-» Ajcrr Xhf Lou Ps-o Pl-ko W4 ■« - - — - .n "w - v - , 



22 



Phe Ser Glu He Tyr Tyr Pro His Lys Tyr Gly Pro Gin Gly Gly Cys 



215 



220 



Ala Asp His Ser Val Phe Glu Arg Met Arg Lys Tyr Gin Met Thr Gly 

225 "JOA ^0^m ^ 



230 



235 



240 



Val Glu Glu Val Thr Gin He Pro Gin Glu Glu His Ala Ala Asn 



245 



250 



255 



Gly 



Pro Glu Leu Leu Arg Lys Arg Arg Thr Thr Ser Ala Glu Lys Asn Thr 
260 265 270 

Cys Gin Leu Tyr He Gin Thr Asp His Leu Phe Phe Lys Tyr Tyr Gly 



280 



285 



Thr Arg Glu Ala Val He Ala Gin He Ser Ser His Val Lys Ala He 
290 295 300 

Asp Thr He Tyr Gin Thr Thr Asp Phe Ser Gly He Arg Asn He Ser 
305 310 315 320 

Phe Met Val Lys Arg He Arg He Asn Thr Thr Ala Asp Glu Lys Asp 
325 330 335 

Pro Thr Asn Pro Phe Arg Phe Pro Asn He Ser Val Glu Lys Phe Leu 
340 345 350 

Glu Leu Asn Ser Glu Gin Asn His Asp Asp Tyr Cys Leu Ala Tyr Val 
355 360 365 

Phe Thr Asp Arg Asp Phe Asp Asp Gly Val Leu Gly Leu Ala Trp Val 
370 375 380 

Gly Ala Pro Ser Gly Ser Ser Gly Gly He Cys Glu Lys Ser Lys Leu 
385 390 395 400 

Tyr Ser Asp Gly Lys Lys Lys Ser Leu Asn Thr Gly He He Thr Val 
405 410 415 

Gin Asn Tyr Gly Ser His Val Pro Pro Lys Val Ser His He Thr Phe 
420 425 430 

Ala His Glu Val Gly His Asn Phe Gly Ser Pro His Asp Ser Gly Thr 
435 440 445 

Glu Cys Thr Pro Gly Glu Ser Lys Asn Leu Gly Gin Lys Glu Asn Gly 
450 455 460 

Asn Tyr He Met Ty« Ala Arg Ala Thr Ser Gly Asp Lys Leu Asn Asn 
465 470 475 480 

Asn Lys Phe Ser Leu Cys Ser He Arg Asn He Ser Gin Val Leu Glu 
485 490 495 

Lys Lys Arg Asn Asn Cys Phe Val Glu Ser Gly Gin Pro He Cys Gly 
500 505 510 

Asn Gly Met Val Glu Gin Gly Glu Glu Cys Asp Cys Gly Tyr Ser Asp 
515 520 525 
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Gin Cys Lys Asp Glu Cys Cys Phe Asp Ala Asn Gin Pro Glu Gly Arg 

9 J D ^ M ^ 



540 



Lys cys Lys Leu Lys Pro Gly Lys Gin Cys Ser Pro Ser Gin Gly Pro 



555 



560 



cys Cys Thr Ala Gin Cys Ala Phe Lys Ser Lys Ser Glu Lys Cys Arg 



570 

Asp Asp Ser Asp Cys Ala Arg Glu Gly He Cys Asn Gly Phe Thr Ala 
3 q 0 • — • 



585 



590 



Leu Cys Pro Ala Ser Asp Pro Lys Pro Asn Phe Thr Asp Cys Asn Arg 



600 605 
His Thr Gin Val Cys He Asn Gly Gin Cys Ala Gly Ser He Cys Glu 



615 620 
Lys Tyr Gly Leu Glu Glu Cys Thr Cys Ala Ser Ser Asp Gly Lys Asp 



635 



640 



Aap Lys Glu Leu Cys His Val Cys Cys Met Lys Lys Met Asp Pro Ser 
645 650 



655 



Gly 



Thr Cys Ala Ser Thr Gly Ser Val Gin trp Ser Arg His Phe Ser 

665 670 

Arg Thr lie Thr Leu Gin Pro Gly Ser Pro Cys Asn Asp Phe Arg Gly 



680 



685 



Tyr Cys Asp Val Phe Met Arg Cys Arg Leu Val Asp Ala Asp Gly Pro 



695 



700 



Leu Ala Arg Leu Lys Lys Ala He Phe Ser Pro Glu Leu Tyr Glu Asn 



710 



715 



720 



lie Ala Glu Trp He Val Ala His Trp Trp Ala Val Leu Leu Met 



725 



730 



735 



Gly 



He Ala Leu He Met Leu Met Ala Gly Phe He Lys He Cys Ser Val 
740 745 750 

His Thr Pro Ser Ser Asn Pro Lys Leu Pro Pro Pro Lys Pro Leu Pro 

^55 760 765 

Gly Thr Leu Lys Arg Arg Arg Pro Pro Gin Pro He Gin Gin Pro Gin 
'70 775 

Arg Gin Arg Prtf Arg Glu Ser Tyr Gin Met Gly His Met Arg Arg 



790 



795 



wo 97/31931 



PUi/uSy7/wii/ 
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WHAT IS CLAIMED IS: 



1. A DNA fragment encoding a human disintegrin expressed differentially during 
arthritis development, capable as being used as a screen disintegrin antagonism, drug 
design and screening. 

2. A human disintegrin according to Claim 1 of a molecular weight, and solubility 
useful as a drug screening agent. 

3. A human disintegrin according to Claim 1 in essentially pure form. 

4. A screening method for compounds capable of binding to a human disintegrin, 
comprising the disintegrin of Claim I. 

5. A screening kit for compounds capable of binding to a human disintq^ 
comprising the di»ntegrin of Claim 1 . 

6. An antibody, or fragment thereof, to human disintegrin of Claim 1 . 

7. A screening method for a metalloprotease mediated disease comprising the 
administration of an antibody according to Claim 6 and observing its effect. 

8. A screening method for osteoarthritis comprisung the administration of an 
antibody according to Claim 7 and observing its effect. 

9. A screening method for osteoarthritis according to Claim 7, where blood, synovial 
fluid or other body fluids are screened. 

10. A screening, kit for osteoarthritis comprising an antibody, or fragment thereof, to 
human disintegrin of Claim 6. 

11. A screeiung method, according to Claim 4, useful in detennining relative potency 
in treating osteoarthritis. 
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12. DNA encoding the disintegrin of Claim I (Seq ID NO 2). 

13. An expression vector or plasmid comprising the DNA of Claim 12. 

14. A ceil comprising the DNA of Claim 12. 

15.. A cdl comprising the expression vector or plasnud of Claim 1 3 . 

16. The cell of Claim 14 where the DNA is foreign to that cell. 

17. An inhibitor of the human disintegrin of Claim 2. 

18. A method of treating a disease state associated with disintegrin activity. 

19. The disintegrin of Claim 2, wherdn the disintegrin is aggrocanase. 

20. A method of treating a disease state according to Claim 18 wherein the disease is 
an arthropothy. 

21 . A method according to Claim 20, wherein the disease is osteoarthritis. 

22. The disintegrin of Claim 2, wherein the disintegrin modulates tissue remodeling or 
breakdown. 
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